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1.  Introduction 

Fluctuations  are  always  present  in  underwater  sound  propagation,  and  are  generally  viewed 
as  a  complication  in  the  signal  identification  and  extraction  process.  However,  in  some  important 
cases  where  the  signals  fluctuate  less  than  the  noise,  it  is  possible  to  exploit  the  different  natures  of 
the  signal  and  noise  fluctuations  to  achieve  a  greater  signal-to-noise  ratio  (S/N).  Wagstaff  s 
Integration  Silencing  Processor  (WISPR)  [1-4]  is  one  example  of  a  non-linear  processor  which  takes 
advantage  of  the  relatively  larger  fluctuations  of  the  noise  compared  to  those  of  the  steadier  signal,  to 
provide  S/N  gains  and  other  enhancements.  The  WISPR  processing  technique  has  been  successfully 
apphed  to  several  experimental  situations. 

The  complex  pressures  /  power  values  which  form  the  data  set  from  which  average  WISPR 
levels  and  average  power  levels  are  calculated,  are  typically  obtained  from  Fourier  transforms  of 
hydrophone  output  voltage  time  series .  Usually  the  time  series  values  in  the  finite  window  are 
multiplied  by  a  window  function  such  as  Gaussian  or  Hann.  Such  a  transform  has  been  called  a 
"short-time  Fourier  transform”  (STFT)  [5].  For  example,  the  Gabor  transform  which  has  a 
Gaussian  window  function  can  be  written  as 

-f  CO  2 

S(f,  X)  =  J  dt .  s(t) .  exp  -  y( 

While  the  width  of  the  time  series  window  in  the  STFT  is  independent  of  frequency  (a  in  Eq.  (1) 
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being  a  constant),  other  transforms  in  which  this  width  vary  inversely  proportional  to  frequency 
have  been  proposed  and  are  being  studied  extensively  [5].  For  example,  the  Morlet  transform  can 
written  as  [6] 


+  PO 

S(f ,  t)  =  Vf  J  dt .  s(t) .  exp 


—  oo 


1 

2 


21 


.  exp  (i27tft) 

9 


(2) 


be 


where  ^  is  a  constant.  This  latter  kind  of  transforms  have  been  applied  to  time-frequency  analyses  to 
produce  improved  resolution  [7, 8].  This  is  illustrated  in  Fig.  1,  which  compares  the  results  of  time- 
frequency  analyses  using  the  STFT  and  the  Morlet  transforms  [7].  In  the  example  of  Fig.  1,  the 
Morlet  transform  produces  improved  resolution  compared  to  the  STFT. 

Since  WISPR  processing  involves  the  calculation  of  several  types  of  averages  of  complex 
pressure  /  power  values  over  periods  of  time,  the  improved  resolutions  due  to  wavelet  transforms 
such  as  those  in  Refs.  7  and  8  could  possibly  improve  WISPR  performance.  In  such  applications  of 
wavelet  transforms  to  WISPR  processing,  the  frequency  dependence  of  the  width  of  the  time 
window  would  be  an  important  factor.  The  variable  width  can  affect  WISPR  performance  in  two 
ways: 


(a)  A  smaller  time  window  entails  a  larger  frequency  bin  width,  greater  noise  in  the  signal 
bin,  and  consequently  a  possible  deterioration  in  WISPR  performance. 

(b)  Even  if  the  signal  width  in  frequency  space  is  less  than  the  FFT  /  DFT  bin  width,  varying 
the  time  window  width  can  cause  a  portion  of  the  signal  energy  to  fall  in  a  bin  adjacent  to  the  main 
signal  bin.  The  effect  of  this  on  WISPR  performance  needs  to  be  investigated. 

A  discussion  of  these  factors  is  the  subject  of  the  present  report.  The  investigation  has  been 
conducted  using  the  example  of  a  successful  application  of  the  WISPR  processor  in  the  detection  of  a 
submerged  source.  That  example  is  briefly  described  in  Section  2.  The  effect  of  varying  the  width 
of  the  time  window  is  discussed  in  Section  3.  The  sensitivity  of  the  WISPR  processor  to  the 
position  of  the  signal  within  its  FFT  /  DFT  frequency  bin  is  discussed  in  Section  3. 


2 


2.  WISPR  Example 

The  example  of  a  successful  application  of  the  WISPR  processor  in  the  detection  of  a 
submerged  source  is  described  in  the  present  section,  along  with  the  relevant  theoretical  details.  This 
example  will  be  used  in  the  investigations  described  below.  The  version  of  the  WISPR  processor 
used  here  is  one  that  includes  phase  variations  [4].  In  order  to  briefly  describe  this  processor, 
consider  the  complex  pressures  for  a  given  frequency  bin  calculated  by  an  FFT  from  a  time  series  of 
hydrophone  voltages.  These  complex  pressures,  which  also  form  a  time  series,  may  be  either  the 
ouq)ut  from  a  single  hydrophone,  or  the  beamformed  output  from  an  array  of  hydrophones.  Let  the 
set  {zn}  (n  =  1, 2,  ...,N)  denote  the  complex  pressures  from  a  single  hydrophone  or  from  a  single 
beam.  The  n*  complex  pressure  is  of  the  form 

(3) 


where  rn  is  the  magnitude,  and  Un  is  the  complex  phase  factor.  This  phase  factor  is  of  the  form 
u^  =  exp  {i[6  n  +  27tf(t  +  nAt)]}  ^4^ 

where  f  is  the  frequency,  and  At  is  the  time  delay  between  the  successive  FFTs.  The  phase  factors 
corresponding  to  the  time  delay  At  can  be  removed  from  the  set  {un},  yielding  a  new  set  (vn)  as 

Vn  =  Un  exp  (—  i27cfnAt)  =  exp  [i(0  n  +  27cft)]  ^5^ 

The  average  of  the  set  {vn}  is  given  by 
1  ^ 


Now  defining  a  quantity  R  as 


R  = 


N 


N 

S  r 


n  =  1 


-  1 

-1 

n 


the  WISPR  value  is  taken  to  be 


(7) 
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The  corresponding  average  WISPR  level  in  decibels  is  then 


W^=101og  W. 


(9) 


The  conventional  definition  of  the  average  power  for  the  set  {zn}  is 


1  N  2 

n  =  1 


(10) 


and  the  corresponding  average  power  level  in  decibel  is 

P«=101ogP.  (jj, 

Having  defined  the  WISPR  level  (including  phase  fluctuations)  in  Eq.  (9),  it  is  now 
straightforward  to  calculate  the  function  (Pds  -  WdB).  For  a  beam  containing  a  steady  tonal  such  as 
that  from  a  submerged  source,  the  WISPR  level  W^b  would  be  nearly  equal  to  the  power  level  Pds. 
and  therefore  the  function  (P^b  -  W(Jb)  would  be  nearly  zero  [1-4].  In  contrast,  the  function  (Pjb  - 
W(1b)  would  have  a  relatively  large  positive  value  for  a  noise  beam  [1-4]. 

The  above  concepts  are  illustrated  in  Fig.  2,  showing  results  from  a  Pacific  experiment  which 
employed  a  144  element  hydrophone  horizontal  line  array.  The  beampattem  for  the  signal  in  the 
frequency  bin  5.225  -  5.322  Hz  is  shown  in  Fig.  2  (sampling  rate:  200  Hz;  FFT  size:  2048  points; 
time  series  overlap  between  succssive  FFTs:  75%,  Hann  window).  The  WISPR  level  is 
approximately  15  dB  lower  than  the  average  power  level  on  aU  beams  except  in  the  vicinity  of  beam 
number  118,  which  indicates  a  steady  submerged  source  by  a  sudden  dip  in  the  difference  function 
(PdB  -  W(1b).  It  is  important  to  point  out  that  the  average  power  level  plot  (the  top  curve)  does  not 
indicate  such  an  unambiguous  detection,  because  it  exhibits  an  even  larger  peak  in  the  vicinity  of 
beam  number  139,  produced  by  a  strong  but  fluctuating  surface  source. 


3.  Variable  Time  Window  Width 

The  effect  of  varying  the  width  of  the  time  window  was  studied  by  repeating  the  above 
calculation  using  different  numbers  of  DFT  points  in  the  initial  extraction  of  amplitudes  from  the  time 
series.  As  in  Fig.  2,  all  calculations  reported  below  focus  on  the  signal  in  the  vicinity  of  5.25  Hz, 
and  use  the  Hann  window.  In  Fig.  3,  the  difference  function  (P^b  -  Wjb)  is  plotted  along  the  Y- 
axis,  and  the  plots  show  the  effect  of  decreasing  the  number  of  DFT  size  from  2048  points  to  948 
points,  and  to  722  points.  In  all  three  cases  the  signal  bin  fully  contained  the  signal.  (The 
importance  of  this  point  will  become  clear  by  the  discussions  in  Section  4).  As  the  DFT  size 
decreases,  the  width  of  the  signal  bin  correspondingly  increase  from  0.097  Hz  to  0.21 1  Hz,  and  to 
0.259  Hz,  allowing  in  more  noise  and  causing  more  fluctuations.  As  a  consequence,  the  goodness 
of  detection  deteriorated,  but  only  slightly,  as  indicated  by  the  decreasing  dip  in  the  vicinity  of  beam 
number  118.  This  change  is  rather  small,  since  the  difference  function  (PdB  -  Wdfi)  at  beam  number 
1 18  changes  from  approximately  2  dB  (for  2048  DFT  points)  to  approximately  5  dB  (for  722  DFT 
points),  in  comparison  with  higher  values  (approximately  15  dB)  for  noise  beams. 


4.  Signal  Position  Within  FFT  Bin 

The  WISPR  level  has  been  observed  to  be  very  sensitive  to  the  position  of  the  signal  within 
the  FFT  /  DFT  frequency  bin.  In  order  to  demonstrate  this,  we  start  with  Fig.  4  which  shows  the 
average  power  level  of  the  signal  in  the  vicinity  of  5.25  Hz  plotted  vs.  frequency,  obtained  from 

16000  point  DFTs.  The  calculation  of  Fig.  2  was  repeated  with  DFT  sizes  1964, 1966, 1968, . 

,1994, 1996.  Figure  5  shows  the  upper  edges  and  the  lower  edges  (in  frequency  space)  of  the  signal 
bin  in  each  case,  with  the  plus  signs  indicating  the  points  at  which  the  DFT  calculations  were  made. 
The  signal  shape  of  Fig.  4  is  marked  to  scale  on  the  right  hand  side  of  Fig.  5.  The  dashed  lines  in 
Fig.  5  represent  the  optimum  window  location  as  judged  from  Fig.  7  below. 

The  average  power  level  P^Bj  the  standard  deviation  c  of  the  power  levels,  and  the  WISPR 

level  WdB  are  plotted  in  Fig.  6  as  functions  of  frequency  for  DFT  sizes  1964, 1966, 1968, . . 

1994, 1996.  Each  frequency  value  is  the  median  frequency  of  the  signal  bin  for  the 
corresponding  DFT.  The  signal  shape  from  Fig.  4  is  also  shown.  The  signal  bin  width  is 
approximately  0.10  Hz  for  all  the  DFTs,  and  is  shown  labeled  "window  size".  The  average  power 
level  P(iB,  the  standard  deviation  a  of  the  power  levels,  the  WISPR  level  W(1b>  and  the  difference 
function  (PdB  -  WdB)  are  shown  in  greater  detail  in  Fig.  7.  The  point  at  approximately  5.295  Hz 
coiresponds  to  the  1964  point  DFT,  and  the  point  at  approximately  5.21  Hz  corresponds  to  the  1996 
point  DFT.  An  examination  of  the  signal  shape  and  the  signal  bin  width  in  Fig.  6  reveals  that  for  the 
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1964  point  DFT  (5.295  Hz)  and  for  the  1996  point  DFT  (5.21  Hz),  small  portions  of  the  signal 
energy  lie  outside  the  signal  bin.  In  spite  of  this,  the  average  power  levels  at  these  points  differ  only 
less  than  1  dB  from  their  peak  value  which  occurs  at  approximately  5.25  Hz  (Fig.  7).  However,  the 
WISPR  level  is  significantly  affected,  and  it  changes  by  as  much  as  5  dB  from  its  peak  level 
(Fig.  7).  Since  the  smallness  of  the  difference  function  is  the  criterion  for  detection  of  stable  signal 
sources,  it  is  clear  that  WISPR  detection  capability  deteriorates  when  even  a  small  portion  of  the 
signal  energy  falls  outside  the  signal  bin,  as  at  5.295  Hz  and  at  5.21  Hz. 

The  above  observations  are  confirmed  by  the  results  in  Figure  8(a),  which  shows  the  upper 
edges  and  the  lower  edges  of  the  signal  bin  for  DFT  sizes  948, 972, 986,  and  1024.  The  dashed 
lines  represent  the  optimum  window  location  as  judged  from  Fig.  7 .  Clearly  portions  of  the  signal 
energy  for  the  972  point  DFT  falls  outside  the  signal  bin  for  this  case.  Figure  8(b)  shows  that  the 
corresponding  WISPR  level  is  about  14  dB  lower  than  the  other  WISPR  levels  where  the  complete 
signal  energy  falls  within  the  signal  bin.  In  contrast,  this  variation  has  only  a  miniscule  effect  on  the 
average  power. 


5.  Conclusions 

The  foregoing  analysis  demonstrates  that  variable  width  of  the  time  series  window  in  FFT  / 
DFT  calculations  has  only  a  small  effect  on  WISPR  performance,  provided  that  the  full  signal  energy 
is  contained  within  the  signal  bin.  When  the  signal  position  within  the  signal  bin  is  such  that  a  small 
portion  of  the  signal  energy  falls  outside  the  signal  bin,  the  average  power  level  is  only  slightly 
affected.  However,  such  loss  of  even  a  small  portion  of  the  signal  energy  from  the  signal  bin  has  a 
dramatic  effect  on  the  WISPR  level,  and  causes  deterioration  in  WISPR  performance  by  several 
decibels.  Therefore,  in  applications  of  wavelet  transforms  to  WISPR  processing,  special  care  must 
be  taken  to  ensure  that  the  signal  bin  contains  the  full  signal  energy,  and  that  no  portion  of  this 
energy  falls  in  adjacent  bins. 

The  most  effective  ways  to  do  WISPR  processing  accurately  when  the  signal  energy 
necessarily  has  to  fall  in  at  least  two  or  more  frequency  bins  due  to  small  bin  width,  is  a  topic  that 
remains  to  be  investigated. 

The  sensitivity  of  the  WISPR  level  to  the  signal  position  within  the  signal  bin  can  be  used  to 
determine  the  signal  frequency  more  precisely,  avoiding  the  necessity  to  carry  out  calculations  using 
much  larger  data  sets. 
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Figure  Captions 

Figure  1.  Comparison  of  the  results  of  time-frequency  analyses  using  (a)  the  "short-time  Fourier 
transform",  and  (b)  the  Morlet  transform  (from  Ref.  7). 

Figure  2.  The  beampattem  for  the  signal  (including  noise)  in  the  frequency  bin  5.225  -  5.322  Hz 
from  a  Pacific  experiment  employing  a  144  hydrophone  horizontal  line  array  (sampling  rate: 
200  Hz;  FFT  size:  2048  points;  Hann  window).  The  average  power  level,  the  WISPR  level, 
and  the  difference  between  the  two  are  labeled. 

Figure  3.  The  effect  of  varying  the  size  of  time  window  on  WISPR  performance.  The  difference 
function  (PdB  -  Wds)  is  plotted  along  the  Y-axis. 

Figure  4.  The  average  power  level  of  the  signal  in  the  vicinity  of  5.25  Hz  plotted  vs.  frequency, 
obtained  from  160(X)  point  DFTs. 

Figure  5.  The  upper  edges  and  the  lower  edges  of  the  signal  bin  for  DFT  sizes  1964, 1966, 1968, 

. ,  1994,  1996.  The  plus  signs  indicate  the  points  at  which  the  DFT  calculations  were 

made.  The  dashed  lines  represent  the  optimum  window  location  as  judged  from  Fig.  7. 

Figure  6.  The  average  power  level  P(JB»  the  standard  deviation  o  of  the  power  levels,  and  the 

WISPR  level  WdB  plotted  as  functions  of  frequency  for  DFT  sizes  1964, 1966, 1968, . . 

1994, 1996.  Each  frequency  value  is  the  median  frequency  of  the  signal  bin  for  the 
corresponding  DFT.  The  plus  signs  indicate  the  points  at  which  the  DFT  calculations  were 
made. 

Figure  7.  The  average  power  level  P(jB>  the  standard  deviation  o  of  the  power  levels,  the  WISPR 

level  WdB,  and  the  difference  function  (PdB  -  WdB)  plotted  as  functions  of  frequency  for  DFT 

sizes  1964, 1966,  1968, . ,  1994, 1996.  Each  frequency  value  is  the  median  frequency 

of  the  signal  bin  for  the  corresponding  DFT.  The  plus  signs  indicate  the  points  at  which  the 
DFT  calculations  were  made. 

Figure  8(a):  The  upper  edges  and  the  lower  edges  of  the  signal  bin  for  DFT  sizes  948, 972, 986, 

and  1024.  The  plus  signs  indicate  the  points  at  which  the  DFT  calculations  were  made.  The 
dashed  lines  represent  the  optimum  window  location  as  judged  from  Fig.  7.  Fig.  8(b):  The 
average  power  level  PdB,  the  standard  deviation  a  of  the  power  levels,  and  the  WISPR  level 

WdB,  for  DFT  sizes  948,  972, 986,  and  1024. 
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